Online Learning and Blackwell Approachability in Quitting Games

نویسندگان

János Flesch

Rida Laraki

Vianney Perchet

چکیده

We consider the sequential decision problem known as regret minimization, or more precisely its generalization to the vectorial or multi-criteria setup called Blackwell approachability. We assume that Nature, the decision maker, or both, might have some quitting (or terminating) actions so that the stream of payoffs is constant whenever they are chosen. We call those environments “quitting games”. We characterize convex target sets C that are Blackwell approachable, in the sense that the decision maker has a policy ensuring that the expected average vector payoff converges to C at some given horizon known in advance. Moreover, we also compare these results to the cases where the horizon is not known and show that, unlike in standard online learning literature, the necessary or sufficient conditions for the anytime version of this problem are drastically different than those for the fixed horizon.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Approachability of convex sets in generalized quitting games

We consider Blackwell approachability, a very powerful and geometric tool ingame theory, used for example to design strategies of the uninformed player inrepeated games with incomplete information. We extend this theory to “generalizedquitting games”, a class of repeated stochastic games in which each player may havequitting actions, such as the Big-Match. We provide three simpl...

متن کامل

Blackwell Approachability and No-Regret Learning are Equivalent

We consider the celebrated Blackwell Approachability Theorem for two-player games with vector payoffs. Blackwell himself previously showed that the theorem implies the existence of a “noregret” algorithm for a simple online learning problem. We show that this relationship is in fact much stronger, that Blackwell’s result is equivalent to, in a very strong sense, the problem of regret minimizati...

متن کامل

Approachability in Stackelberg Stochastic Games with Vector Costs

The notion of approachability was introduced by Blackwell [1] in the context of vector-valued repeated games. The famous ‘Blackwell’s approachability theorem’ prescribes a strategy for approachability, i.e., for ‘steering’ the average vector cost of a given agent towards a given target set, irrespective of the strategies of the other agents. In this paper, motivated by the multi-objective optim...

متن کامل

A Learning Scheme for Approachability in MDPs and Stackelberg Stochastic Games

متن کامل

A Learning Scheme for Blackwell’s Approachability in MDPs and Stackelberg Stochastic Games

The notion of approachability was introduced by Blackwell ([8]) in the context of vector-valued repeated games. The famous ‘Blackwell’s approachability theorem’ prescribes a strategy for approachability, i.e., for ‘steering’ the average vector-cost of a given player towards a given target set, irrespective of the strategies of the other players. In this paper, motivated from the multi-objective...

متن کامل

ذخیره در منابع من

ذخیره در منابع من قبلا به منابع من ذحیره شده

{@ msg_add @}

با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره شماره

صفحات -

تاریخ انتشار 2016

Online Learning and Blackwell Approachability in Quitting Games

نویسندگان

چکیده

منابع مشابه

Approachability of convex sets in generalized quitting games

Blackwell Approachability and No-Regret Learning are Equivalent

Approachability in Stackelberg Stochastic Games with Vector Costs

A Learning Scheme for Approachability in MDPs and Stackelberg Stochastic Games

A Learning Scheme for Blackwell’s Approachability in MDPs and Stackelberg Stochastic Games

عنوان ژورنال:

اشتراک گذاری